[Feature] support seed parameter #3116

lizexu123 · 2025-07-31T08:35:47Z

支持用户传入seed参数

示例用法:

1. 服务用法:

1.1 输出每次随机

import openai

ip = "0.0.0.0"
service_http_port = "13188"  # 服务配置的

client = openai.Client(base_url=f"http://{ip}:{service_http_port}/v1", api_key="EMPTY_API_KEY")

response = client.chat.completions.create(
    model="default",
    messages=[
        {"role": "user", "content": "北京天安门在哪里?"},
    ],
    temperature=1,
    stream=False,
    seed=None,（也可以不加这一行)
)

print(response.choices[0].message.content)
print("\n")

也可以使用

curl -X POST "http://10.54.104.207:13188/v1/chat/completions" -H "Content-Type: application/json" -d '{
  "messages": [
    {"role": "user", "content": "北京天安门在哪里？"}
  ]
}'

1.2 固定输出

import openai

ip = "0.0.0.0"
service_http_port = "13188"  # 服务配置的

client = openai.Client(base_url=f"http://{ip}:{service_http_port}/v1", api_key="EMPTY_API_KEY")

response = client.chat.completions.create(
    model="default",
    messages=[
        {"role": "user", "content": "北京天安门在哪里?"},
    ],
    temperature=1,
    stream=False,
    seed=1，
)

print(response.choices[0].message.content)
print("\n")

也可以使用

curl -X POST "http://10.54.104.207:13188/v1/chat/completions" -H "Content-Type: application/json" -d '{
  "messages": [
    {"role": "user", "content": "北京天安门在哪里？"}
  ],
  "seed":1
}'

```2. 离线方式

2.1 输出随机

from fastdeploy.engine.sampling_params import SamplingParams
from fastdeploy.entrypoints.llm import LLM

model_name_or_path = "Qwen/Qwen3-0.6B"

# 超参设置
sampling_params = SamplingParams(temperature=0.1)
llm = LLM(model=model_name_or_path, tensor_parallel_size=1,reasoning_parser="qwen3")
prompt = "北京天安门在哪里?"
messages = [{"role": "user", "content": prompt}]
output = llm.chat([messages],
                   sampling_params)
              
print(output)

2.2 输出固定

from fastdeploy.engine.sampling_params import SamplingParams
from fastdeploy.entrypoints.llm import LLM

model_name_or_path = "Qwen/Qwen3-0.6B"

# 超参设置
sampling_params = SamplingParams(temperature=0.1,seed=1)
llm = LLM(model=model_name_or_path, tensor_parallel_size=1,reasoning_parser="qwen3")
prompt = "北京天安门在哪里?"
messages = [{"role": "user", "content": prompt}]
output = llm.chat([messages],
                   sampling_params)
              
print(output)

paddle-bot · 2025-07-31T08:35:52Z

Thanks for your contribution!

qingqing01

增加单测验证下，固定seed 后的稳定性。以及 sampling 固定 seed 的稳定性，也增加单测。

support seed

d4aa468

qingqing01 reviewed Jul 31, 2025

View reviewed changes

fix

17903a8

lizexu123 force-pushed the seed_1 branch from f307dee to 17903a8 Compare July 31, 2025 10:05

lizexu123 added 2 commits August 2, 2025 22:32

add SamplingMetadata seed test

ccc0dbe

The next_tokens values are inconsistent!

65e3927

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Feature] support seed parameter #3116

[Feature] support seed parameter #3116

Uh oh!

lizexu123 commented Jul 31, 2025 •

edited

Loading

Uh oh!

paddle-bot bot commented Jul 31, 2025

Uh oh!

qingqing01 left a comment

Uh oh!

Uh oh!

[Feature] support seed parameter #3116

Are you sure you want to change the base?

[Feature] support seed parameter #3116

Uh oh!

Conversation

lizexu123 commented Jul 31, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

示例用法:

1. 服务用法:

1.1 输出每次随机

1.2 固定输出

```2. 离线方式

2.1 输出随机

2.2 输出固定

Uh oh!

paddle-bot bot commented Jul 31, 2025

Uh oh!

qingqing01 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

lizexu123 commented Jul 31, 2025 •

edited

Loading